AITopics | standard normal distribution

NSNQuant: ADouble Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache

Neural Information Processing SystemsJun-16-2026, 15:55:42 GMT

Large Language Model (LLM) inference is typically memory-intensive, especially when processing large batch sizes and long sequences, due to the large size of key-value (KV) cache. Vector Quantization (VQ) is recently adopted to alleviate this issue, but we find that the existing approach is susceptible to distribution shift due to its reliance on calibration datasets. To address this limitation, we introduce NSNQuant, a calibration-free Vector Quantization (VQ) technique designed for low-bit compression of the KV cache. By applying a three-step transformation--1) a token-wise normalization (Normalize), 2) a channel-wise centering (Shift), and 3) a second token-wise normalization (Normalize)--with Hadamard transform, NSNQuant effectively aligns the token distribution with the standard normal distribution. This alignment enables robust, calibration-free vector quantization using a single reusable codebook. Extensive experiments show that NSNQuant consistently outperforms prior methods in both 1-bit and 2-bit settings, offering strong generalization and up to 3 throughput gain over full-precision baselines.

large language model, machine learning, quantization, (20 more...)

Neural Information Processing Systems

Country: Asia (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

artificial intelligence, machine learning, pruning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (1.00)
Europe (1.00)
North America > Canada > British Columbia (0.28)

Genre:

Contests & Prizes (0.50)
Research Report (0.46)

Industry: Leisure & Entertainment > Gambling (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)

Add feedback

Hierarchical Probabilistic Principal Component Analysis of Longitudinal Data

Zhang, Xinyu, Qaqish, Ameer, Lin, D. Y., Li, Didong

arXiv.org Machine LearningApr-27-2026

In many longitudinal studies, a large number of variables are measured repeatedly over time, with substantial missing data. Existing methods, such as probabilistic principal component analysis (PPCA), are ill-equipped to handle such incomplete, high-dimensional longitudinal data, as they fail to account for the nested sources of variation and temporal dependency inherent in repeated measures. We introduce hierarchical probabilistic principal component analysis (HPPCA), a two-level probabilistic factor model that explicitly separates between-subject variance from time-varying within-subject dynamics. The within-subject latent factors are modeled by a Gaussian process. We develop an EM algorithm to handle missing data and flexible covariance kernels, accelerated by computationally efficient initializers. Simulation studies demonstrated that HPPCA robustly recovers model parameters subspaces and substantially outperforms both standard PPCA and multivariate functional PCA in imputation accuracy, even under heavy missingness and model misspecification. An application to the long COVID symptoms in the Researching COVID to Enhance Recovery adult cohort revealed that HPPCA effectively captured the data's hierarchical structure and its learned features significantly improved the prediction of clinical outcomes and the recovery of masked clinical records compared to exisiting methods.

artificial intelligence, machine learning, matrix, (17 more...)

arXiv.org Machine Learning

2604.22015

Country: North America > United States (0.46)

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.66)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.81)

Add feedback

b9ed18a301c9f3d183938c451fa183df-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 21:46:36 GMT

converge, ratifiable policy, supp, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

3cfc102893d47c46295cb437949dccb5-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 17:14:53 GMT

dimension, high dimension, hsic, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Middle East > Jordan (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Atheoryoflearningwithconstrained weight-distribution

Neural Information Processing SystemsFeb-9-2026, 06:00:03 GMT

A central question in computational neuroscience is how structure determines functioninneuralnetworks.

artificial intelligence, constraint, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

5ef20b89bab8fed38253e98a12f26316-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 14:35:11 GMT

algorithm, logistic regression problem, posterior, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

3cfc102893d47c46295cb437949dccb5-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 12:31:01 GMT

dimension, high dimension, hsic, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > Middle East > Jordan (0.04)
North America > United States (0.04)
(2 more...)

Genre: Research Report (0.68)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Proofs A.1 Proof of Proposition 1 We first show that for any T T

Neural Information Processing SystemsOct-3-2025, 00:58:29 GMT

A.2 Proof of Relation (3) We can write D One class of transport maps we consider in our numerical experiments (i.e., to approximate Another underlying class of transports that we use in our numerical experiments are inverse auto-regressive flows (IAFs). IAFs are built as a composition of component-wise affine transformations, where the shift and scaling functions of each component only depend on earlier indexed variables. Flows are typically comprised of several IAF stages with the components either randomly permuted or, as we choose, reversed in between each stage. Here we discuss how generalized linear models may naturally admit lazy structure. Here we describe the numerical algorithms required by the lazy map framework.

artificial intelligence, machine learning, posterior, (17 more...)

Neural Information Processing Systems

Genre: Research Report (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

A Q-value convergence We here show that if a tabular agent converges to a policy π in a continuous NDP then Q

Neural Information Processing SystemsAug-17-2025, 02:23:12 GMT

See Singh et al. (2000). Moreover, SARSA and Expected SARSA are also both appropriate, if the agent is greedy in the limit. Note that condition 2 requires that the agent takes every action in every state infinitely many times Proof. Let A satisfy the following in a given NDP: A is greedy in the limit, i.e. for all δ > 0, P (Q A's Q-values are accurate in the limit, i.e. if π Then φ has a fixed point. Theorem 3. Every continuous NDP has a strongly ratifiable policy.

artificial intelligence, converge, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

Filters

Collaborating Authors

standard normal distribution

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

NSNQuant: ADouble Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache

Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets

Hierarchical Probabilistic Principal Component Analysis of Longitudinal Data

b9ed18a301c9f3d183938c451fa183df-Supplemental.pdf

3cfc102893d47c46295cb437949dccb5-Paper-Conference.pdf

Atheoryoflearningwithconstrained weight-distribution

5ef20b89bab8fed38253e98a12f26316-Supplemental.pdf

3cfc102893d47c46295cb437949dccb5-Paper-Conference.pdf

A Proofs A.1 Proof of Proposition 1 We first show that for any T T

A Q-value convergence We here show that if a tabular agent converges to a policy π in a continuous NDP then Q